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Description 
Related Applications 

5 "'"his ippiicntion claims the benefit of u 3 P'ovisional Application No 60 015 502 Med May 3 i GG 6 

Background of the Invention 

The generation or destruction of '^ssug requires constant reorganization anc restr uctunng of the extracellular matrix 

■0 ECM) components including interstitial ccllaqens basement membrane collagen tibrcnectin :aminin aggrecan and 
various proteoglycans Hemegard and Oldbei g FASEBJ 1989 3 2042-2051 Woessner FASEBJ 1991 5 214-2154 
Normal types of remocelinq orocesses include embryonic development post-partum involution of the uterus ovulation 
wound healing and bone and growth plate remodeling Woessner et al Steroids 1989 54 491-499 Weeks et al 
Biochim Biophys Ac'a 1976 445 205-214 Lepage and Gache EMBO J 1990 9 3003-3012 Wnde and Sanders Dcv- 

J 5 Dyn 1993 1 Q S( 3) 225-39 Similar processes also occur in disease states such as joint destruction in rheumatoid and 
osteoarthritis periodontia and tumc cell metastas s Thompson and Oegema J Bone Joint Surg 1979 61 407 16 
Reynolds et al Ad\ -Dent-Res 1994 5(2)312-9 One example of these processes is the migration of macrophages to 
the site of inflammation as in the case of synovial tissue in rheumatoid arthrit s Cutolo et al Clin and Expcr Rheum 
1993 1 1 331-339 The EC VI components are icgjiated m both normal and disease states by various exogenous 

20 and endogenous factors For example in turnoi formation the differentiation state ot the ceil can increase the rate of 
degradation of the ECM Benya Patnot Immunopathol Res 1988 7 51-54 Likewise the presence of metalloprotei- 
nases or their inhibitors can alter the composition of the ECM An imbalance o f metalloproteinases and tissue inhibitors 
of matrix metaHoprotetnases (TIMP) has been shown to contribute to the pathogenesis of osteoarthritis Dean et al J 
Chn Invest 1989 54 675-685 Cytokines growth factors and the extracellular environment can all contribute to the 

25 alteration of the ECM Tyler Btochem J 1985 227 569-578 Dinarello Sem Immunol 1992 4 133-145 McConnell et 
al J Cell Biol 1987 105 1057-98 

The growth of cartilage and bone is actualized by cells such as articular chondrocytes and osteoblasts The main 
t'jncton of these cells -n immature 'issue is the deposition and remoceling of the cartilage or bone matrix In adult 
' ssue these cells maintain this matrix in order to ensure its proper function In both cases this encompasses secretion 

50 of the extracellula r components as well as secretion of proteins mvolveo in the turnover of the ECM 

A major species of protein secreted by these cells and involved m the turnover of the ECM are the metalloprotei- 
nases Woessner FASEB J 1991 5 214-21 54 A new type of secretory glycoprotein has also been identified in human 
cartilage osteoblasts synovial cells sheep and bovine oviduct and mammary cells and macrophages Nyrikos and 
Golds Biochem J 1990 268. 265-268 Hakala et al J Biol Chem 1993. 268(34) 25503-25810 Johansen et al J 

35 Bone and Mm Res 1992 7(5)501-511 Rejman and Hurley Biochem Biophys Res Commu n 1988 150 329-334 
DeSouza and Murray Endocrinology 1995. 1 36(6) 2485-2496 Hollak et al J Clin Invest 1994 93 1 288-92 Anas et 
at Biol, of Reproduction 1994 51 685-694 These novel mammalian proteins all share regions of significant homology 
to the bacterial ana *ungal chitinases and therefore are referred to herein as "chitmase-hke" proteins Chitinases a-e 
en/ymes that hydroiyze glycoside bonds They bear a subtle similarity to lysozymes 'rem mammals and function as 

■JO endoglycosidases with a sceciticity for N-acetyl-glucosamine linkages However these types of chitin-like structures 
homopolymers of N-acetyl-qlucosamme are not normally encountered in mammalian tissue 

The human cartilage glycoprotein. HC gp-39 is a orotem with an apparent molecular weight of approximately 39 
kDa secreted by both articular chondrocytes and synovial fibroblasts Nyr-kos and Golds Biochem J 1990 268 
265-268 Hakala et al J Biol Chem 1993 268(34) 25803-25810 This protein has been described as a marker for 

4 5 joint injury appearing in the blood and synovial fluid from patients diagnosed with rheumatcd arthritis Johansen et a : 
British J of Rheumatology 1993 32 949-955 The gene encoding this protein has oeen cloned and is expressed 
specifically in cartilage and synoval ce;ls of rheumatic joints Hakala et al J Biol Chem 1993 268(34) 25803-25810 
The protein YKL-40 has aiso been identified as one of the major secretory products of cu'tured human osteoblastic 
cells (osteocarcinoma cell line MG-63j expressed in response to 1 25-dihydroxyvitamtn D3 stimulation Johansen et 

50 al J Bone and Mm Res 1992 7(5, 501-511 Johansen et al Br J Rheumatol 1993 32 949-55 The N-termm^l 
portion of YKL-40 was sequenced ana found to be identical to HC gp-39 Upon further sequencing YKL-40 and HC 
gp-39 were found to be identical 

Chitotriosidase -s an en/yme which has been identified as a member of this "chitmase-liKe" family Renkema et 
al J Biol Chem 1995 27C 2195-2202 Hollak et al J Clin Invest 1994 93 1288-92 This protein also has an 

55 apparent molecular weight of 39 kDa and shares N-termmal homologies with HC gp-39 the bovine mammary protein 
and several bacter al chitinases Activity of this enzyme was originally detected from cells of patients afflicted wr.h 
Gaucher Disease < 3D) Gaucher Disease is an inherited deficiency in the activity of glucocerebrosidase a lysosomal 
hydrolase This defect results in an accumulation of glucosylceramide ! glucocerebroside ) in the lysosomes of macro- 
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phages Accumulation cf hpid-laden macrophages results in hopatosplenomcgaly bone lesions and neurological 
anomalies After morphological differentiation of monocytes into macrophages in culture the cells begin to produce 
and secrete increasing amounts of chitotnosidase This increase is on average 600 times greater in GD patients than 
in parents with other pathological conditions The elevation m chitotnosidase activity can be effectively reduced how- 

5 ever upon initiation of en/yme supplementation therapy Unlike the other members of the chitinase-like family chitot- 
nosidase has chitolytic activity Like the bacterial enzyme it has the abi ity to degrade chitin a/ure a polymer of beta- 
1-4-linked N-acetylglucosamine moieties 

A new lymphocyte-associated protein of the chitinase-like family referred tc as HC gp-39L has now been identified 
HC-gp39L protien is believed to be involved in tissue remodeling in the mammalian cell and thus serve as useful tools 

j o m the development of therapeutics and diagnostics for tissue remodeling disorders 

Summary of the Invention 

In accordance with one aspect of the present invention there is provided a novel mature polypeptide which is HC 
'5 gp-39L as well as biologically active and diagnostically or therapeutically useful fragments analogs and derivatives 
'hereof The polypeptide of the present invention is of human origin 

In accordance with another aspect of the present invention '.here are provided isolated nucleic acid molecules 
encoding human HC gp-39L including mRNAs CNAs cDNAs genomic DNAs as we I as analogs and biologically 
active and diagnostically or therapeutically useful fragments thereof 
20 In accordance with yet a further aspect of the present invention there is provided a orocess for producing HC gp- 

39L by recombinant techniques comprising culturmg recombinant prokaryotic and/or eukaryotic host cells containing 
a human nucleic acid sequence ot HC gp- 39L jnder conditions promoting expression of these proteins and subsequent 
r ecovery of these proteins 

In accordance with yet a further aspect of the present invention there is provided a process for utilizing HC gp- 
-5 39L polypeotide. for therapeutic purposes for example in the treatment of tissue remodeling disorders 

In accordance with yet a further aspect of the present invention there is provided a process for targeting HC- gp- 
39 L located on the surface of lympnocytes with specific antibodies for therapeutic purposes such as the treatment of 
autoimmune or tissue remodeling disorders 

In accordance with yet a further aspect of the present invention there «s orovidod a process for utilizing HC gp- 
30 39|_ polypeptide for diagnostic purposes for example in the diagnosis of tissue remodeling disorders 

These and other aspects of the present invention should be apparent to those skilled in the art from, the teachings 
herein 

Brief Description ot the Drawings 

F'gure 1 provides the polynucleotide coding sequence (SEQ ID NO 1 ) and deduced amino acid sequence (SEQ 
ID NO: 2) of the polypeptide HC gp-39L 

Detailed Description of the Invention 

In accordance with one aspect of the present invention there is provided an isolated nucleic acid sequence (SEQ 
C NO 1 ) which encodes the chitirase-hke protein HC gp-39L (HC gp-39 like} The deduced amino acid sequence of 

this protein (SEQ ID NO 2) is shown in Figure 1 Further HC gp-39L has seen isolated from :ymphocytes and has 

been found to share 5* n <> amino acid identity (6S°o ammo acid similarity) witn the cmrmase-liKe protein HC gp-39. 
ncluding the conserved c h it in -ise-hke domains and cysteine motif However HC- gp-3 G L differs from the other chitmase- 

1 ike protems in that the cDNA extends past the homology with HC gp-39 and cnitotriosidase producing an N-terminal 

e <tension 

The nucleic acid sequenc cs of the presort invention may be in the form of RNA o r ir the form of DN A which DN A 
ircludescCNA genomic DNA and synthetic DNA The DNA may be double -stranded or single-stranded and it sing e 
50 stranded may be the coding strand or ncn-coding ;anti-sense) strand The ceding sequence which encodes HC- gp- 
3 -:, L p'Otcm may be identical 'o the coding sequence shown in Figure 1 or mav be a different coding sequence which 
coamq sequence as a result of the redundancy or degeneracy of the genetic code encodes the same protein as the 
DNA of Figure 1 

The present invention includes po'ynucleotides encoding the same mature polypeptide as shown in Figure 1 
^ Further the inventions includes variants of such polynucleotides that encode a fragment derivative or analog of the 
polypeptides of Figure 1 Among variants in this regard are variants that differ from the aforement oned polynucleotides 
by nucleotide substitutions deletions or additions The substitutions deletions or additions may involve one or more 
nucleotides The variants may dg altered in coding or non-ceding regions or both Alterations in the coding regions 
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occur nnturaiiy As nerem -iocvg 'ncjc-itcc t^o ooiy°uciectide m.-iy "nvo i ceding sequence Ahicr. s ,-i nrituinlly oc- 
curring .-niche Vri'.nrM of :ho coding sequences shewn n F-qure i As Known m [he nrt -in n.ionc variant >s an alternate 

^ form of a polynucleotide sequences wnicn m^y hnvo a substitution deletion :r -jddit'O^ o f o n e or more nucleotides 

Among the paM-cuiany preferoa embodiments c f the .n venticr^s m th^s regard nre polynucleotides encoa ng 
polypeptides having ! .he drrnno acid sequences of HC qp-3 G L set out in Figure 1 variants analogs derivatives r-md 
fragments thereof and fragments of 'he variants analogs and derivatives 

Furthe r particula r ly preferred in this regard are polynucleotides encoding hZ gp-39L van ants analogs derivatives 

~o and fragments and variants analogs and derivatives of the fragments which have the ammo acid sequence of HC 
go-39L polypeptide of Figure 1 in which several a few 5 to 10 1 to 5 1 to 3 2 1 or no ammo acid residues are 
substituted deleted or added in any :ombination Especially prefer red among these are silent substitutions additions 
and deletions which do not alter the properties and activities of HC gp-3°L Also especially preferred in this regard 
a r e conservative substitutions most highly preferred are polypeptides having the amino acid sequence of Figure 1 

7 $ without substitutions 

Further preferred embodiments of the invention are polynucleotides that are more than 65°o identical to a polynu- 
cleotide encocing HC gp-39L polypeptide havincj the amino acid sequence set out in Figure 1 or variants close ho- 
mologs derivatives and analogs thereof as described above Alternatively most highly preferred are polynucleotides 
that comprise a r egic-n that ;s more than 55% identical to a polynucleotide encoding HC qp-39L polypeptide of Figure 
~o 1 in this rega r d polynucleotides more than °0°o identical to the same are particularly preferred and among these 
particularly preferred polynucleotides those with 95°o or more identity are especially preferred Furthermore those 
with 97°o or more identity are highly preferred among those with 95°» or more identity and among these those with 
98°o or more and 99°o or more identity are particularly highly preferred with 99°o or more being the more preferred 
Also particularly preferred in this regard are polynucleotides encoding a polypeptide having the amino acid se- 
-5 quence of HC gp-39L set out in Figure 1 As set out elsewhere herein the polynucleotide may encode the polypeptide 
I a continuous region or m a plurality of two or more discontmuous exons and it may comprise additional regions as 
well which are unrelated to the coding region or regions 

Most highly preferred m this regard are polynucleotides '.hat comprise a region that is more than 85% identical to 
the HC gp-39L-encodmg portion of the polynucelotice set out m Figure 1 Alternatively most highly preferred are poly- 
pe nucleotides that comprise a region that is more than 85% identical to the HC gp-39L-encoding portion set forth in Figure 
1 Among such polynucleotides those more than 90° c identical to the same are particularly preferred and among 
these particularly preferred polynucleotides, those with 95°<> or more identity are especially preferred Furthermore 
those with 37% or more identity arc highly prefenea among those with 95°o or more identity and among these those 
with 9B°o or more and 99°o or more identity are particularly highly preferred with 99% or more being the more preferred 
35 of these. 

The present invention also includes poiynuc'ectides in which the sequence encoding the mature polypeptide is 
fused in the same reading frame to additional sequences Such sequences include signal sequences which facilitate 
transport of the nascent protein into the endoplasmic reticulum pro-sequences that are associated with inactive pre- 
cursor forms of the polypeptide which may facilitate trafficking of the protein m a cell or out of a cell or may improve 

~>o persistence of the protein m a cell or in an extracellular compartment Such sequences also may be added to facilitate 
production and purification or to add additional functional domains as discussed elsewhere herein 

A further preferred emoodiment is an HC gp-39L polynucleotide sequence comprising a Met codon upstream (5') 
of the 5' sequence in SEQ ID 1 It is most preferred that the Met codon be part of a continuing open reading frame as 
that encoding the polypeptide of Figure 1 (SEQ ID 2) Using the sequence of HC gp-39L of SEQ ID NO 1 skilled artisans 

J ^ can obtain such upstream or other upstream sequences by making primers to perform amplification using routine 
methods Preferred primers useful to derive upstream sequences are obtained from the sequence 



5'-cagcacctgtggctggggagcccagatgaagtgtggctctatcttgtaTGTGAG 

CACACCCACATTTTCACTGCCATTATCTGGGACAGCagaiiccaggtttggctcaacagat-3' 

Fragments of this sequence are also useful preferably fragments of about 15-50 nucleotides in length and most 
preferably 20-25 nuc eotides in length 

The present invention further relates to the HC gp-39L polypeptide which has the deduced ammo acid sequence 
:SEQ ID NO 2) of Figure 1 as well as fragments anaiogs and derivatives of such polypeptide 

The terms "fragment " "derivative" and "analog" when referring to the polypeptide of Figure 1 means a polypeptide 
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which retains essentially the same biological function or activity ds such polypeptide Thus ho analog includes a pro- 
protein which can be activated by cleavage of the proprotein portion to produce an active mature polypeptide The 
polypeptide of the present invention may be a recombinant polypeptide a natuial polyp cptide or a synthetic polypeptide 
preferably a recombinant polypeptide 

The fragment der ivative or analog of the polypeptide of Figure 1 may be ■: r> one in which one or more of the ammo 
acid residues are substituted with a conserved or non-conserved amino acid residue 'preferably a conserved am no 
acid residue) and such substituted amino acid r esidue may or may not be one encoded by the genetic code or mi) one 
in which one or more of the amino acia residues includes a substituent group cr \ in . one in which the mature polypeptide 
is fused with another compound such as a compound to increase the half-li f e of the poypeotide (for example poly- 
ethylene glycol) or (iv) one in which the additional amino acids are fused to tne mature polypeptide sucn as a leader 
or secretory sequence or a sequence which >s employed for purfication of the mature polypeptide or a proprotein 
sequence Such fragments derivatives and analogs a r e deemed to be within the scope of those skilled in the art from 
tne teachings herein 

The polypeptide and nucleic acid sequences of the present invention are preferably provided in an isolated formi 
and preferably are purified to homogeneity The term "isolated" means that the materia' is removed from its original 
environment (e g the natura. environment if A is naturally occurring) For example a naturally-occumng nucleic acid 
sequence cr polypeptide present in a living animal is not isolated but the same nucleic acid seguence o r polypeptide 
separated from some or all of ;he coexisting materials in the natural system is isolated Such nucleic acid sequences 
could be part of a vector and/or such nucleic acid sequences or polypeptides couic be cart of a composition and still 
be isolated in that such vector or composition is not part of its natural environment 

The present invention also relates to vectors which include one of the nucleic acic sequences ot the presert 
■nventicn. host cells wtveh are genetically engineered with vectors of the invention ^nd the production of polypeptides 
cf the invention oy recombinant techniques 

Host colls a^e genetically engineered (transduced or transformed or transfected) with the vectors of this -nventicn 
which may be. for example a cloning vector or ar expression vector The vector may be for example in the form of 
a plasmid a viral particle a phage, etc The engineered host eels can be cultured in conventional nu'nent meda 
modified as appropriate for activating promoters selecting transformants or amplifying HC gp-39L cene The culture 
conditions such as temperature pH and the like are those previously used with the host cell selected \o r expression 
and will be apparent to the ordinarily skilled artisan 

The nucleic actd sequences of the present invention may be employed for producing polypeptides by recombinant 
techniques The nucleic: acid sequences may be included in any one of a variety of expression vectors for expressing 
a polypeptide Such vectors include chromosomal nonchromosomal and synthetic ON A sequences e g doriva'ives 
cf SV4C bacterial plasmids phage DNA baculovirus' yeast plasmids vectors derived from combinations of plasmids 
and phage DNA viral DNA such as vaccinia adenovirus fow, pox virus and pseudorab.es However any other vector 
may be used as long as it is replicable and viable in the host 

The DNA sequence of HC gp-39L can be inserted into the vector by a variety of procedures In general the DNA 
sequence is inserted into an appropriate restriction endonuclease siteis) by procedures known in the art Such proce- 
dures and others are deemed to be within the scope of those skilled in the art 

The DNA sequence in the expression vector .s operatively linked to an appropriate expression control sequence 
■si (promoter) to direct mRNA synthesis As representative examples of such prcmioters there may be mentioned 
LTR or SV4Q promoter the E coh lac or trp. tne phage lambda P L promoter and other promoters known to control 
expression of genes in prokaryot c or eukaryotic cells or their viruses The expression vector also ccntams a r ibesome 
binding site fo r translation initiation and a transcription terminator The vector may a^so include appropriate sequences 
for amplifying expression 

In adcition the expression vectors preferably contain one or more selectable marker genes to provide a phenotypic 
trait for selection of transformed nost cells such as dihydrofolate reductase or neomycin resistance for eukaryotic ceil 
culture or such as tetracycline or ampicillm resistance in E coli 

Trie vectcr containing the appropriate DNA sequence as hereinabove described as we I as an appropriate pro- 
moter or control sequence may be employed to tr nnsform an appropriate host to perm it the nost to express the prcteir. 

As representative examples of app-opnate hosts there may be mentioned bacteria cells such as E cell. Strep- 
'omyecs. Salmonella (yphimunum fungal cells such as yeast insect cells such as Drosopfvla S2and Spodoptera Sf9: 
animal ceils sucn as (J HO COS or Bowes melanoma adenoviruses plant coils etc The selocticn cf an appropriate 
host -s deemed to be wit n m the scope cf those skilled in the art from the teachings herein 

More carticularly tne present invention also includes recombinant constructs comprising one or more of the se- 
quences as described above The constructs comprise a vector such as a plasmid or vira. vector into which a sequence 
encoding HC gp-39L has been inserted in a forward or reverse orientation In a preferred aspect of this embodiment 
the construct further comprises regulatory sequences including for example a promoter operably linked to the se- 
quence Large numbers of suitable vectors and promoters are known to those of skill in the art and are commercially 
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-:v-i '.-ib'o *c c/. -.Ovivc-i vo z-c\ cocj cy '^v J ex-i'^c o Bactc a, cub 7 J p.QE 60 pCc-- C ncje^ c3S 

cL^C en i ne- c ■ • > * ~~4 cc'uOSC"C-t S* oL^ks ;:\ M 5A cNH ■ - A c\H46A ■ Si rn Inge-no ■ c-Vc^-a 

C\>\.i23-J pr\r\23.~-3 pDF54G c-R|T5 F^am- ncia ■ E^aryot.c cWLNEC cSV2CATpCG44 pXT! p 3G . Stf alaqene , 
pSVK3 pBPV cM3G pSVL ■ Fharmaciai h:wovor riny other oiasmid c< vector r^ay be used ns : ong as tney -i t e 
■opticace and v;aoe in the nost 

Promoter region s Cnn co selected from any cjgs ; ? cci gene ^S:nq 2 AT .ch.o'amphenic : i transferase - vectors or other 
vectors w th select-idle barkers Two appropriate vectors are pKK232-9 a fid pCM7 Particular named bacter^i pro- 
moters include lac GcZ ^3 ^7 gpt : anbda p R P L anatrp Eukaryctie promoters include CMV immediate early HSV 
tnymidine Kinase early and iate SV40 L T Rs fiom retrovirus and mo.se meL-ilotnionem -i Selection of the appropriate 
vector and promoter is well within the level of ordinary skill in the art 

m a further embodiment the present invention relates to host rolls cortaming the above-descr bed constructs 
The host cell can ce a higher eukaryotie cell such as a mammalia 0 cell o r a lower eukaryotic cell such as a yeast 
ceil or the host ce I can be a prokaryotic cell such as a oactenal cell Introduction of the construct into the host cell 
can be effected by calcium phosphate transfection DE AE -C extran mediated transfection or electrcporation i Davis 
L Dibner M 3attey I Basic Methods in Molecular Biology, f 19860 

The constructs in host ceils can be used in a conventional manner to produce the gene product encoded by the 
recombinant sequence Alternatively pcypeptides can be synthetically produced by conventional peptide synthesizers 

Mature proteins can be expressed in mammalian cells yeast bacteria or other cells jnder the control of appro^ 
priate promoters "-ell-free transition systems can also be employed to produce such proteins using RN As derived 
from the DNA constructs of the p r esent invention Appropriate clon.ng and expression vectors for use with prokaryotic 
and eukaryo:ic hosts are described by Sambrook et al Molecular Cloning: A Laboratory Manual. Second Edition Cold 
Spring Harbor N v ( 1 99.9) the disclosure of which is hereby incorporated by reference 

Transcription of the DNA encoding a protein by higher eukaryotes is increased by msenng an enhancer sequence 
into the vector Enhancers are cis-acting elements of DNA usually about from 10 to 300 bp that act on a promoter to 
increase its Transcription Examples including the SV40 enhancer on the late side of the replication origin bo 130 to 
270 a cytomegalovirus early promoter enhancer the pciyoma enhancer on the late side of the replication origin and 
adenovirus enhancers 

Generally recombinant exp.-ession vectors will include origins of replication and selectable ma r kers permitting 
transtormation of the host cell eg the amp^cillm resistance gene of E colt and S cere vis tae TRP1 gene and a 
promoter derived Tom a highly-expressed gene to direct transcription of a downstream structural sequence Such 
promoters can be derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK) a- 
factor. acid phosphatase or heat shock proteins, among others The heterologous structural sequence is assembled 
in appropriate phase with translation imt-ation and termination sequences and preferably a leadei sequence capaoie 
of directing secretion of translated protein into the periplasms space or extracellular medium Optionally the heterol- 
ogous sequence can encode a fusion protein including an N-terminal identification peptide imparting desired charac- 
teristics, e g. stabilization or simplified purification of expressed recombinant product 

Useful expression vectors for bacterial use are constructed oy ■nserttng a structural DNA sequence encoding a 
desired protein together with su. table translator! initiation and term-nation signals in operable reading phase wirh a 
functional promoter The vector will comprise one or more phenotypic selectable markers and an origin of replication 
to ensure maintenance of the vector and to if desirable provide amplification within the host Suitable prokaryotic 
hosts for transformation include E colt. Bacillus suhtilis. Salmonella typhimunum and various species within the genera 
Pseudomonas. Streptomyces and Staphylococcus, although others may also be employed as a matter of choice 

As a representative but nonhmiting example useful expression vectors for oactenal use can comprise a selectable 
marker and bacterial origin of replication derived from commercially available plasmids ccmpnsmg genetic elements 
of the well known cloning vector pBR322 (ATCC 37017) Such commercial vectors include for example pKK223-3 
(Pharmacia Fine Chemicals Uppsala Sweden] and GEM1 (Promega Biotec Madison Wl USA) These pBR322 
"backbone" sections are combined with an appropriate promoter arid the structural sequence to be expressed 

Following transformation of a suitable host strain and growth of the host strain to an appropriate cell density the 
selected promoter is induced by appropriate means (e g temperature shift or chemical induction) and cells are cultured 
for an additional period 

Ce'ls are typically harvested by centrif ugation disrupted by physical or chemical means ind the resulting crude 
extract retained for further purification 

Microbial cells employed in expression of proteins can be disrupted by any convenient method mc udmg freeze- 
thaw cycling sonication mechanical disruption or use of cell iys'ng agents such metheds ire well know to those 
skilled in the art 

Vancus mammalian cell culture systems can also be employed to express recombinant proteins Examples of 
mammalian expression systems include the COS-7 lines of monkey kidney fibroblasts described oy Gluzman Cell 
1°S1 23 175 and other cell lines capable of expressing a compatible vector for example the C 127 3T3 CHG HeLa 
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and BhK cell tines Mammalian expression vectors will comprise an onqtn of r ophc^tion a suitable promoter' and en- 
hancer ind also nnv necessary ribosome binding sites polyadenyiation 3 to splice Jonor and acceptor sites tran- 
scriptional termination scguences and 5' Marking r Dntranscribod sequences DNA sequences iJerived from the SV40 
splice rind polyadenyiation sites may be used to provide the required nontr -inscribed genetic elements 

^ Polypeptides can then be recovered ana our idea from recombinant cell ;ultures bv methods including ammonium 

sulfate or" ethanol preciptation acid extraction anion or cation exchange chromatography ph csphocollu lose coroma- 
toqrapny hydrophobic interaction chromaiography iffmity chromatography hydroxylapatite chromatography and lectin 
chromiatog'aphy Protein r efolding steps car be useo as necessary in completing c 3n*iguratic>n of the mature pro'.em 
Finally high performance liquid chromatc grapny (HPLCj can be employed \o r final purifi:ation steps 

1 0 It s now believed that the cnitmase-hke proteins particularly HC gp-39L are involved in tissue remodeling HC gp- 

39 HC qp-39L and human chitotriosiasc -ire all homologous to microbial chit;nases in particular all three proteins 
share a similar cysteine motif and homology to the active site of microbial chitmases Accoramgly. these proteins can 
ce used in the development of '.reatments for 'issue r emcdeling diseases and in the d'agnosis of these diseases 

Based on the homology with HC gp-39 HC gp-39L is believed to be a product of mac r ophages Primary human 

? 5 -monocytes activated to become macrophages through adherence to plastic secrete high evels of HC gp-39 into the 
culture media In addition the induction of HC ^p-3° expression has been co r rela:ea wth the differentiation :>f myeloid 
coll lines HL60 and U937 toward a macrophage lineage by induction with chorbol es'er HC gp-39 message can be 
detected in human atherosclerotic plaques derived from endarterectomies Accordingly it is believed that HC gp-39 
I'unct'ons in various tissues undergoing extensive remodeling including those associated with rheumatoid arthritis and 

20 ,-uherosclerosis where macrophages oiay an important role Further rHC gp-39 was shown to stimulate in vitro smooth 
muscle ceil migration indicative of involvement in the remodeling processes occurring in diseased arteries 

The tissue-specific expression pattern cf HC; gp-39L has also been determined HC gp-39L is expressed specifically 
• n lymphocyte cell lines which is consistent wth tissues from which HC :jp-39L is found In addition human multiple 
tissue Northern blots show that HC gp-3°L message is found at the h ghes* levels in thymus and spleen which also 
consistent with lymphocyte-specific expression HC gp-39L protein was detected oy western blot using polyclonal 
antiserum multiple produced from rabbits mmuri/ed with a fusion construct containing a part a- HC gp-39L pro'em 
HC go- protein is associated with tine membrane fraction o- the ceils and is not secreted mtc the media Accordingly 
it is believed that HC gp-39L functions as a cell surface lymphocyte rriarker for activated lymphocytes HC gp-3 c L is 
therefore a useful '.arget 4 or inflammatory :>r t ssue remodeling diseases involving "-cell or B-cell activation sucn as 

30 -heumatoid arthritis 

HC gp-39L prcteins can also be used -n the production of antibodies The proteins their fragments or other deriv- 
at ves or analogs thereo 4 or cells expressing them can be used as an immunogen to produce antibodies thereto 
R ese antibodies can be for example po-yclcnal or monoclonal antibodies The present invention also incudes chi- 
merc smg;e chain and Humanized antibodies as well as Fab fragments or the pic duct of an Fab expression library 
Various procedures known in the art miay ce used for the production of such antibocies and fragments 

Antibodies genera:ed against these proteins can be obtained by direct inject on of the protein into an animal 
preferably a nonhuman 7he antibody so obtained wili then bind the protein itself In this manner even a sequence 
encoding only a fragment of the protein can be used to generate antibodies binding the whole native prote n Such 
antibodies can then be used to isolate these proteins from tissue expressing these proteins 
-0 Fo preparation of monoclonal anhboces any technique which provides antibodies produced by continuous cell 

mo c j :ures can be used Examples include the hyb' idoma technique ( Kohler and Mi'stein Nature 1 975 256 495-497 i 
:he tnoma technique the human B-coi hybndoma 'echmcue fKo/bor et a! immunology Today 1983 4 72 1 and the 
EBV-nybridoma technique to produce human monoclonal antibodies (Cole et al in Monoclonal Antibodies and Cancer 
Therapy. Alan R Liss :nc 1985 pp 77- -6) 

Techniques described for the production of single chain antibodies (U S Patent 4 946 775) can be adapted to 
produce smg:e chain antioodies to immunogenic HC gp-39L protein product Also transgenic mice may be usee to 
express humanized antibodies to immunogen c HC- qp-39L protein products 

Tl is invention is also related to the use of HC gp-3 G L as a diaqnosbc Defection of a mutated fc rms of HC gp-3 Q L 
wii allow a diagnosis of a tissue remodel ng disease such as meumato d an^ntis or atherosclerosis 
50 Individuals carrying mutations in cne jr more of these HC ap-3 Q L proteins may be detected at the DNA level oy 

a vHmr:y of techniGues Nucleic acids for :!iaqncsis may be obtained from a patients cells such as from blood urine 
sal va tissue biopsy and autopsy material The genomic DNA may be used directly for detection or may be ampiit ed 
en/yrr.aticaliy Oy using PC- R i Saiki et al N-iiuic 1 986 324 1 03-1 66) prior to analysis RNA or cDN A m^ay also be used 
tor the same purpose As an example P3R primes complementary to the nucleic: acid encoding HC gp-39L can be 
^ used to identify and analyze mutations in this p r o ? em For example deletions and insertions can be detected by a 
change in si/e of the amplified product m comparison to the normal genotype Pomt mutations can be identified by 
hybridi/mg amphfiec DNA to radiolabeled RNA or alternatively radiolabeled antisense DNA sequences Perfectly 
matched sequences can be distinguished from mismatched dup lexes by RNase A digestion or by differences in melting 
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- v : • " •"• ■ : .. :;o'v.oo^ : ,_ o ■•j'o'c-^jo .;ene .-i^-a genes -^av^c; -^..inters '^.-iy oo -ovonioj by ino air oct 

G\A so^O'-.rq method naai: on Coned DNA scgmor.is may go employed .-is probes to detect specific DNA 
sc gments "ho sensitivity of in i s method is greatly enhdncoa when comtmoa witn PGR For example a sequencing 

: " p'l-ncr s ..sed with double -stranded PGR product or a single -stranded template molecule generated by a modified 
FCR T ro soquor~.ee determination is performed cy :onventional procedures wr.h radiolabeled nue'ect'de or by auto- 
fT " -itic sequencing procedures with fluorescent-tags 

Genetic :estirg based on DNA secuence diffe r ences mdy be achieved by detection of alteration in electrophoretic 
mobility of DNA fragments m gels with or without denaturing agents Sma ! l sequence deletions and insertions can be 

'0 visualized by high resolution gel electrophoresis DNA fragments of different sequences may be distinguished on de- 
naturing formamide gradient gels in which the mobilities of different DNA fragments are retarded ;n the gel at different 
positions according to their specific melt, ng or partial melting temperatures tsee eg Myers et al Science 1985 230 
1142) Sequence changes at specific locations may also be revealed by nuclease protection assays such as RNase 
and S1 protection or the chemical cleavage method <e q Cctton et al PNAS USA 1985 55 43 Q 7-4401 

:S Thus tne detection of a specific DNA sequence may be achieved by methods such as hybridization RNase pro- 

tection chemical cleavage direct DNA sequencing or the use of restriction en/ymes (eg Restriction Fragment Length 
Polymorphisms (RFLPVi and Southern blotting of genomic DNA 

In riddrion to more conventional gel-elect r cphoresis ana DNA sequencing mutations can also be detected by in 
si'ij analysis 

~° The present invention also relates *o a diagnostic: assay for detecting altered levels of HC gp-39L protein in various 

tissues Assays used to detect levels of such protein in a sample derived from a host are well-known to those of skill 
in the an and include radioimmunoassays competitive-binding assays Western Blot analysis and preferably an ELISA 
assay An ELISA assay initially comprises preparing an antibody specific to a HC gp-39L protein preferably a mono- 
clonal antibody In addition a reporter antibody is prepared against the monoclonal antibody To the reporter antibody 
is attached a detectable reagent such as radioactivity fluorescence or m this examole a horseradish peroxidase en- 
zyme A sample is ■emoved from a host and incubated on a solid support e g a polystyrene dish, that binds the 
proteins in me sample Any free protein binding sites on the dish are then covered by incubating with a non-specific 
p'Otem like BSA Next the monoclonal antibody is incubated n the dish curing which time the monoclonal antibodies 
a:tach to HC gp-39L protein attaches to the polystyrene disn All unbound monoclonal antibody is washed out with 

oo buffer The reporter antibody linked to horseradish peroxidase is now placed in the dish resulting in binding of the 
reporter antibody to any monoclonal antibody bound to HC gp-39L Unattached reporter antibody is then washed out 
Peroxidase substrates are then added to the dish and the amount of color developed in a given time period is a meas- 
urement of the amount of HC gp-39L protein present in a given volume of patient sample when compared against a 
standard curve 

35 A competition assay tray be employed wherein antibodies specific to HC gp-39L protein are attached to a solid 

support and labeled protein and a sample derived from the host are passed over the solid support and the amount of 
label detected attached to the solid support can oe correlated to a quantity of HC gp-39L protein in the sample 
The following examples are provided for illustrative purposes only and are not intended to limit the invent on 

EXAMPLES 

Example 1 Expression of HC gp-39 in vitro 

Recomoinant HC gp-39 was produced in wf/o by transfectmg an expression vector containing the cDNA into CHO 

~5 cells and selecting stable cell lines 

The ful length HC gp-39 gene was cloned into CDN in two pieces a 660 bp Sac ll-Bst Ell fragment plus a 673 bp 
Bst E I J -Bel fragment ligated together with the CDN vector cut with Sac : I -Bel I This construct was transfected into 
GRO ACC 317 Cells by standard methods Specifically 20 mg of the HC gp-39 plasmid construct was linearized by 
restriction digestion and electroporated into 1 25 x 10 7 cell in 1 ml Cells were seeded at a density of 2 5 x 10 3 cells 

50 per well and selected in minimal media in the absence of nucleosides Secreted protein was recovered from the con- 
ditioned media and purified using Q sepharose flow through S sepharcse capture and si/ed on Suprose 12 The 
resulting material was greater than 95°o pure as determined by Coomassie blue staining HC gp-39L can also be 
expressed as described in this Example 1 

55 Example 2. Assay for smooth muscle migration 

Smooth muscle migration was measured in a chamber divided by a semi-permeable filter to which rHC gp-39 
(2 3-766 mg. ml) was bound on one side On the other side human fetal smooth muse le cells were cultured The extent 
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to which cells migrated into the filter whs monitored color imetr icaily 'Optical density; rHC qp-3 Q elicited a stronger 
rmq ration response in this assay than PDGF a known stimulator of smooth muscle in iq ration This assay in this Example 
1 can also be readily used for analysis of HC qp-3°L 

Example 3: Production of polyclonal antibodies generated against HC gp-39L 

A partial HC gp-39L protein has been expressed in E cell and used to generate polyclonal antiserum A 1461 bp 
Ndel-Xhol cDNA fragment of HC qp-39L was cloned in frame as a fusion with an N-terminal His tag in the Pet 16B 
vector system iNovagem These constructs were transformed into E coli through standard methods The cells were 
propagated lysed and the protein purified by nickel affinity chromatography The purified fusion protein was used to 
immunize rabbits for the production of polyclonal antiserum 

Example 4: HC gp-39L is associated with the membrane fraction of lymphocytes 

Polyclonal antiserum was used to detect HC gp-39L protein on Western blots Protein is detected in whole cell 
lysates of lymphocytes It is also detected in the membrane fraction but is not associated with the cytoplasmic fraction 
or secreted into the media HC gp-39L may function as a cell surface lymphocyte marker for a subset of activated 
lymphocytes (inflammatory or tissue remodeling diseases) and be detected by FACS analysis a secondary detection 
method known to those of skill in the art 
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s equence l : : :;g 

(11 GENERAL INFORMATION 

(i) APPLICANT: Kirkpatrick, Robert 
Rosenberg , Martin 

(li) TITLE OF THE INVENTION : Human Cartilage Glycoprotein 

(in) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

;A) ADDRESSEE : SmithKline Beecham Corporation 

!B) STREET: 7 39 Swedeland Road 

■,C) CITY: King of Prussia 

i'D) STATE: PA 

IE) COUNTRY: USA 

F] ZIP: 19406-0939 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 
<C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : 60/016,532 

(B) FILING DATE: 03-MAY-1996 



(viii) ATTORNEY /AGENT INFORMATION : 

(A) NAME: Han, William T 

(B) REGISTRATION NUMBER : 5219 

(C) REFERENCE /DOCKET NUMBER: P50390 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-5219 

( B ) T ELEFAX : 610-270-5090 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

SO <xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CAGCACCTGT GGCTGGGGAG CCCAGATGAA GTGTGGCTCT ATCTTGTATG TGAGCACACC 6 0 

CACATTTTCA CTGCCATTAT CTGGGACAGC AGAACCAGGT TTGGCTCAAC AGATTTCTCT 12 0 

TTCCACCCAT CTATTGCAGG TGTAGTGGTC TTGCTGCTTC TCCAGGGAGG ATCTGCCTAC 180 

AAACTGGTTT GCTACTTTAC CAACTGGTCC CAGGACCGGC AGGAACCAGG AAAATTCACC 24 0 

CCTGAGAATA TTGACCCCTT CCTATGCTCT CATCTCATCT ATTCATTCGC CAGCATCGAA 3 00 

AACAACAAGG TTATCATCAA GGACAAGAGT GAAGTGATGC TCTACCAGAC CATCAACAGT 36 0 
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CTC AAAACCA 
TCCAAAGGGT 
ATAATCCTGT 
GATCAGAAAG 
AAGGACTTCA 
AGGCAAATGA 
AACCTCCTGT 
AGCCCTCTGA 
GCTGTGGGGT 
ACATATGGGC 
GGCCCTGGAG 
TGCCAGTTCC 
GTCAAGGGGA 
TTCTTAAAGA 
ACTGGCAAAT 
GGCTCCCTGT 
CTGCTCTCTC 
TCTTCCAAGC 
GCAGCTGTTG 



AGAATCCCAA 
TCCACCCTAT 
TTCTGAGGAA 
AAAACACTCA 
CAAAATCCAC 
TTGATAACAG 
CCTTTGACTT 
GCAAGGGGTG 
ACTGGATACA 
ACTCCTTCAC 
CTGCTGGACC 
TGAAAGGAGC 
ACCAGTGGGT 
ATTTAAACCT 
CCTGCAACCA 
GAAGGATTAA 
CCAGGAATTC 
CTTTCCTGAC 
ACTTGTTGCC 



ACTGAAAATT 
GGTGGATTCT 
CCATAACTTT 
TTTCACTGTG 
CAAGGAAAGG 
CTATCAAGTT 
CCATGGGTCT 
GCAGGACAGA 
TAAGGGAATG 
ACTGGCCTCT 
CATCACAGAG 
CAAGATCACG 
GGGCTATGAT 
GGGAGGAGCC 
GGGCCCTTAC 
CTTACAGAGA 
TCATGTGGGA 
TTCCTCTTAG 
CTGAAGTACA 



CTCTTGTCCA 
TCTACATCAC 
GATGGACTGG 
CTGATTCATG 
CTTCTCTTGA 
GAGAAACTGG 
TGGGAAAAGC 
GGGCCAAGCT 
CCATGAGAGA 
GCAGAAACCA 
TCTTCAGGCT 
CGGCTCCAGG 
GATGTGAAGA 
ATGATCTGGT 
CCTCTTGTCC 
AGCAGGCAAG 
TTCCCCTTGC 
ATCATAGATT 
ATAAAAAAAA 



TTGGAGGGTA 
GCTTGGAATT 
ATGTAAGCTG 
AGTTAGCAGA 
CTGCGGGCGT 
CAAAAGATCT 
CCGTTATCAC 
CCTACTACAA 
AGGTGGTCAT 
CCGTGGGGGC 
TCCTGGCCTA 
ATCAGCAGGT 
GTATGGAGAC 
CTATTGACAT 
AAGCAGTCAA 
ATGACCTTGC 
CAGGATGGCC 
GGACCTGGTT 
TTCATTTTGC 



CCTGTTTGGT 
CATTAACTCC 
GATCTACCCA 
AGCCTTTCAG 
ATCTGCAGGG 
GGATTTCATC 
TGGCCACAAC 
TGTGGAATAT 
GGGCATCCCC 
CGCTGCCTCT 
TTATGAGATC 
TCCCTACGCA 
CAAGGTTGAG 
GGATGACTTC 
GAGAAGCCTT 
TGCCTGGGGC 
TTTGGATCTC 
TTGTTTTCCT 
TCCAGT 



420 
460 
5-10 
6C0 
660 
7 2C 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 

13 80 
1440 

14 9 6 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 423 amino acids 

(B) TYPE: airino acid 

(C) STRANDEDNSSS: single 

(D) TOPOLOGY: linear 



{li) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
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Claims 

1. A nucleic acid sequence encoding HC gp-39L 

2. A protein encoded by the a nucleic acid sequence comprising the sequence of claim 1 

3. A nucleic acid sequence comprising the sequence of SEQ ID NO 1 

4. A method of diagnosing a tissue remodeling disorder related to expression of a mutated HC gp-39L protein in a 
host comprising: 

in') synthesizing PGR primers complementary to a nucleic acid encoding a normal HC gp-39L protein 

(b) obtaining nucleic acids from cells of a host 

(c) amplifying the obtained nucleic acids with the synthesized PGR primers to obtains an amplified product and 

(d) comparing the amplified product to a normal genotype encoding the normal HC gp-39L protein to detect 
any mutations in the amplified product 

5. The method of claim 4 wherein the HC gp-39L comprises the polypeptide sequence of SEQ ID NO 1 

6. A method of detecting altered expression of a HC gp-39L protein in a host comprising 

(a) preparing a first antibody specific to a HC gp-39L protein 

(b) preparing a reporter antibody against the first antibody said reported antibody linked to an attached de- 
tectable reagent 

!c) obtaining a sample from a host 

(d) incubating said sample on a solid support which binds any proteins in said sample 

ie) incubating said first antibody with any bound protein on said solid support so that said first antibody attaches 
to any HC gp-39L proteins bound to said solid support 

if) contacting said reporter antibody linked to a detectable reagent with said attached first antibody on said 
solid support 

igj detecting said detectable reagent and 

(hi determining an amount of HC gp-39L protein \n said sampie 

7. The method of claim 6 wherein the HC gp-39L comprises the polypeptide sequence of SEQ ID NO 1 
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